The TORGO database of acoustic and articulatory speech from speakers with dysarthria

نویسندگان

  • Frank Rudzicz
  • Aravind Kumar Namasivayam
  • Talya Wolff
چکیده

This paper describes the acquisition of a new database of dysarthric speech in terms of aligned acoustics and articulatory data. This database currently includes data from seven individuals with speech impediments caused by cerebral palsy or amyotrophic lateral sclerosis and ageand gender-matched control subjects. Each of the individuals with speech impediments are given standardized assessments of speech-motor function by a speech-language pathologist. Acoustic data is obtained by one head-mounted and one directional microphone. Articulatory data is obtained by electromagnetic articulography, which allows the measurement of the tongue and other articulators during speech, and by 3D reconstruction from binocular video sequences. The stimuli are obtained from a variety of sources including the TIMIT database, lists of identified phonetic contrasts, and assessments of speech intelligibility. This paper also includes some analysis as to how dysarthric speech differs from non-dysarthric speech according to features such as length of phonemes, and pronunciation errors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Vocal tract representation in the recognition of cerebral palsied speech.

PURPOSE In this study, the authors explored articulatory information as a means of improving the recognition of dysarthric speech by machine. METHOD Data were derived chiefly from the TORGO database of dysarthric articulation (Rudzicz, Namasivayam, & Wolff, 2011) in which motions of various points in the vocal tract are measured during speech. In the 1st experiment, the authors provided a bas...

متن کامل

Production Knowledge in the Recognition of Dysarthric Speech

Production knowledge in the recognition of dysarthric speech Frank Rudzicz Doctor of Philosophy Graduate Department of Department of Computer Science University of Toronto 2011 Millions of individuals have acquired or have been born with neuro-motor conditions that limit the control of their muscles, including those that manipulate the articulators of the vocal tract. These conditions, collecti...

متن کامل

Assessment of Articulatory and Velopharyngeal Sub-systems of Dysarthric Speech

Dysarthria is a neuromotor impairment of speech that affects one or more of the speech sub-systems. It is reflected in the acoustic characteristics of the phonemes as deviations from their healthy counterparts. To capture these deviations, in this work a continuous speech, an isolated-style monophone-based, and a triphone-based speech recognition systems are developed. These speech recognition ...

متن کامل

Towards a noisy-channel model of dysarthria in speech recognition

Modern automatic speech recognition is ineffective at understanding relatively unintelligible speech caused by neuro-motor disabilities collectively called dysarthria. Since dysarthria is primarily an articulatory phenomenon, we are collecting a database of vocal tract measurements during speech of individuals with cerebral palsy. In this paper, we demonstrate that articulatory knowledge can re...

متن کامل

Learning mixed acoustic/articulatory models for disabled speech

This paper argues that automatic speech recognition (ASR) should accommodate dysarthric speech by incorporating knowledge of the production characteristics of these speakers. We describe the acquisition of a new database of dysarthric speech that includes aligned acoustics and articulatory data obtained by electromagnetic articulography. This database is used to train theoretical and empirical ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Language Resources and Evaluation

دوره 46  شماره 

صفحات  -

تاریخ انتشار 2012